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Abstract 

Single-molecule spectroscopy has developed into a widely used method for probing the structure, dynamics, and 
mechanisms of biomolecular systems, especially in combination with Forster resonance energy transfer (FRET). In 
this introductory tutorial, essential concepts and methods will be outlined, from the FRET process and the basic 
considerations for sample preparation and instrumentation to some key elements of data analysis and photon 
statistics. Different approaches for obtaining dynamic information over a wide range of timescales will be explained 
and illustrated with examples, including the quantitative analysis of FRET efficiency histograms, correlation 
spectroscopy, fluorescence trajectories, and microfluidic mixing. 

Single-molecule spectroscopy fluorescence, Forster resonance energy transfer, correlation functions, fluorescence correlation spectroscopy, 
confocal detection, protein dynamics, protein folding 



Introduction 

Single-molecule spectroscopy has become an integral 
part of biophysical research and nanobiotechnology, and 
a wide range of biological questions are now addressed 
with these methods [1], including the mechanisms of 
molecular machines [2-4], protein-nucleic acid interac- 
tions [5,6], enzymatic reactions [7-9], and protein or 
RNA folding [10,11], to name but a few. A key advan- 
tage of single-molecule experiments is the possibility to 
avoid ensemble averaging. Instead of obtaining observa- 
bles that constitute an average over a large number of 
molecules in the sample, as it is the case with most con- 
ventional methods, information is extracted from the 
molecules one by one. This often allows direct access to 
the distributions of the underlying molecular properties, 
e.g., the number of thermodynamic states populated and 
the distributions of intramolecular distances or stoichio- 
metries. Another advantage of single-molecule methods 
is that dynamic information can often be obtained from 
equilibrium measurements, e.g. via correlation functions 
[12-16]; the analysis of broadening and exchange 
between subpopulations in Forster resonance energy 
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transfer (FRET) efficiency histograms [17-20]; or directly 
from fluorescence trajectories of immobilized molecules 
[16,19,21-26]. 

Since the use of these methods has been spreading 
rapidly, a wide range of literature has become available on 
the topic, including several textbooks that describe the 
technical and conceptual details of single-molecule spec- 
troscopy [1,27,28]. Here, I will briefly summarize the key 
aspects of single-molecule fluorescence spectroscopy on 
an introductory level, with a focus on the investigation of 
protein structure and dynamics with single-molecule 
FRET. I will assume basic knowledge of fluorescence spec- 
troscopy [29] and biomolecular structure and function; 
beyond that, the principles will be covered step by step, 
with the goal of familiarising the reader with the type of 
information available from single-molecule experiments, 
including potential pitfalls and limitations, and illustrating 
the capabilities of the method. 

This article is based on a tutorial held by the author at 
the CNRS School "Nanophysics for Health" in November 
2012. The examples used to illustrate the concepts are 
thus primarily taken from the research of the author. For 
more representative overviews of the wide range of cur- 
rent applications and recent advances in single-molecule 
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spectroscopy, I refer the reader to more comprehensive 
recent reviews [6,11,30-32]. 

From ensembles to single molecules 

Forster resonance energy transfer (FRET) 

FRET is an attractive method for probing distances and 
distance dynamics in and between biological macromo- 
lecules (such as proteins and nucleic acids) because it is 
most sensitive in the range of a few nanometres, the 
typical size of these cellular machines. The theoretical 
basis of this process was developed already in the 1940s. 
Theodor Forster [33,34] showed that the rate of excita- 
tion energy transfer, k^, between a suitable donor and 
acceptor chromophore is proportional to the inverse 6'^ 
power of the distance separating them. 



(1) 



where /cd'^ is the excited state lifetime of the donor in 
the absence of the acceptor (Figure 1); r is the distance 
between donor and acceptor; and Rq, the Forster radius, 
is a proportionality constant that depends on the inter- 
action between the transition dipoles of donor and 
acceptor. Rq is calculated as 
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where / is the overlap integral between the donor 
emission and the acceptor absorption spectra; Qd is the 



donor fluorescence quantum yield; n is the refractive 
index of the medium between the dyes; and A/^a is 
Avogadro's constant, depends on the relative orienta- 
tion of the chromophores, with 
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where 0t is the angle between the donor and accep- 
tor transition dipoles, and 0d and 9^ are the angles 
between the transition moments and the line connect- 
ing the centres of donor and acceptor, respectively. In 
cases where the rotational reorientation of the chro- 
mophores is fast compared to the donor excited state 
lifetime, k averages to a value of 2/3, which simplifies 
the application of FRET significantly. The probability 
that a photon absorbed by the donor will lead to 
energy transfer to the acceptor, called the FRET effi- 
ciency, E, is given by k^/{k^ + A^j), which is determined 
experimentally either by counting the number of 
emitted donor and acceptor photons or by measuring 
the donor lifetime in the presence and absence of 
acceptor (see below). The beauty of Forster theory is 
that Rq can be directly obtained, without any theoreti- 
cal calculations, from readily measurable spectroscopic 
quantities. 

One of the first quantitative tests of Forster's theory 
and the key experiment that established FRET as a 
method for the investigation of biomolecules was pub- 
lished by Stryer and Haugland in 1967 [35]. They 
found the dependence of the transfer efficiency on the 
inter-chromophore distance to be in agreement with 




Figure 1 Forster resonance energy transfer (FRET), (a) A molecule labelled with donor and acceptor chromophores (in this case a stiff 
polyproline peptide with Alexa Fluors 488 and 594 [86]). The donor fluorophore (green) can be excited specifically (blue). It can then either emit 
a fluorescence photon itself or transfer its excitation energy to the acceptor (red), (b) The process can be depicted in terms of a Jablonsky 
diagram illustrating the transitions between ground and excited states of donor (D) and acceptor (A). The rate of energy transfer, kf, depends on 
the distance, r, between donor and acceptor, resulting in the characteristic distance dependence of the transfer efficiency E (c). At the Forster 
distance, Rq, E = 0.5. For currently available FRET pairs suitable for single molecule spectroscopy, Rq is in the range of 4 to 8 nm. 
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Forster's famous result [33], according to which 
(Figure 1) 



Consequently, the Forster radius is the characteristic 
distance that results in a transfer efficiency of 1/2. The 
idea of this "spectroscopic ruler" [35] has had a large 
impact on the investigation of biomolecular structure 
and dynamics on distances in the range of about 1 to 10 
nm [29,36-38]. Initially, all of these measurements were 
done in "ensemble experiments", i.e. by interrogating 
samples with large numbers of molecules in steady-state 
or time-resolved fluorometers. About 30 years after the 
work of Stryer and Haugland, the first single-molecule 
FRET measurements were reported by Ha et al. [39]. 
But what does it take to do single-molecule fluorescence 
and FRET experiments? 

Basic considerations 

Already in the 1970s, it became possible to measure 
fluorescence from single atoms in dilute atomic beams 
[40], i.e. in the gas phase, where the background pro- 
blem is minimal. Observing single molecules in the con- 
densed phase is much more challenging because 
Rayleigh and Raman scattering result in a large back- 
ground signal. Additionally, the ubiquity of contami- 
nants makes great demands on the purity of the matrix. 
Finally, an atom in vacuum is a chemically very stable 
system, even in its excited state, whereas fluorescent 
molecules in the condensed phase survive only a limited 
number of excitation-emission cycles before they are 
irreversibly destroyed, typically by a chemical reaction 
with other molecules, in a process termed photobleach- 
ing. Single molecule detection in a solid or liquid matrix 
(an aqueous solution in the case of most biomolecular 
experiments) therefore requires additional measures. 
First of all, the method must provide a signal from an 
individual molecule that exceeds the background caused 
by the large number of solvent molecules. 

This requirement makes fluorescence a very attractive 
approach, where a dye molecule resonantly interacts 
with the excitation light [41-43]. Due to the Stokes shift, 
the emitted light can be selected spectrally from scat- 
tered light, e.g. with interference filters (Figure 2a), 
which can provide ratios of transmitted and reflected 
intensities at different wavelengths in excess of 10 . 
Since the background signal from solvent is proportional 
to the number of solvent molecules contained in the 
total observed volume, a common strategy for the detec- 
tion of signal from individual molecules is to reduce the 
size of the detection volume as much as possible. Such 
spatial selection can be achieved, for instance, by very 



tightly focusing a laser beam into the sample, combined 
with confocal detection [44], or by total internal reflection 
fluorescence (TIRF) [45], which allows excitation of a 
~200 nm layer of the sample in the evanescent field of a 
reflected laser beam. A further reduction of the observa- 
tion volume can be achieved with zero mode waveguides, 
which allow measurements at higher concentrations of 
fluorescent sample [46]. Here, I will focus on confocal 
fluorescence detection because it is a commonly used 
method; currently allows the broadest range of accessible 
timescales; and enables the most complete access to the 
photon statistics of the fluorescence process. 

Fluorescence labelling 

To be able to observe individual biomolecules, we first 
need to furnish them with suitable fluorophores. Since 
even tryptophan, the natural amino acid with the high- 
est fluorescence quantum yield (-0.13), is not suitable 
for single molecule detection owing to its low photo- 
stability, and since fluorescent proteins [47] are still of 
limited use for the investigation of biomolecular struc- 
ture and dynamics because of their relatively large size 
and their inferior photophysical properties, labelling 
with extrinsic fluorophores absorbing and emitting in 
the visible range of the spectrum is usually unavoidable 
for single-molecule spectroscopy. For FRET, two (or 
more) chromophores are needed, and their specific pla- 
cement in the protein ideally requires groups with 
orthogonal chemistry. A wide range of labelling strate- 
gies exist, ranging from nonspecific labelling of amino 
or thiol groups present in the natural polypeptide [48] 
to the incorporation of non-natural amino acids [49], 
advanced chemical ligation [50], and intein-based 
approaches [51]. I refer the reader to reviews of the 
topic for a more detailed treatment [48,52,53]. 

Currently, the simplest and most common approach 
for labelling proteins is still to rely on cysteine derivati- 
sation using maleimide chemistry. Increased specificity 
can be achieved by removing unwanted natural cysteines 
by site-directed mutagenesis or by introducing cysteines 
with different reactivity owing to different molecular 
environments within the protein [54]. Labelling is 
usually combined with multiple chromatography steps 
to purify the desired adducts. If higher specificity is 
required, e.g. for FRET with more than two fluorophores 
[55], cysteine labelling can be combined with orthogonal 
chemistries, such as non-natural amino acid incorpora- 
tion [56]. A wide variety of suitable organic dyes with 
various functional groups for protein labelling have 
become commercially available. Examples of particularly 
popular chromophores for single molecule FRET are the 
cyanine dyes [57] or the Alexa Fluor series [58], which 
meet the key requirements for single-molecule fluores- 
cence of biomolecules: high extinction coefficients and 
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Figure 2 Overview of instrumentation and data reduction in confocal single-molecule spectroscopy, (a) The scheme illustrates the main 
components of a 4-channel confocal single molecule instrument that collects fluorescence photons separated by polarization and wavelength 
and records their individual arrival times, (b) Illustration of a FRET-iabelled protein present in solution in an equilibrium between folded and 
unfolded states, (c) Small part of a binned trajectory of detected photons recorded from molecules freely diffusing in solution (in this example 
Csprm in 1.5 M GdmCI [13]), where each fluorescence burst corresponds to an individual molecule traversing the diffraction-limited confoca 
volume, (d) Transfer efficiency histogram and 2-dimensional histogram of the relative donor fluorescence lifetime, roVro. versus transfer 
efficiency, E, calculated from individual bursts, resulting in subpopulations that can be assigned to the folded and unfolded protein, and 
molecules without active acceptor at f = 0 (shaded in grey). The straight and curved red lines correspond to the relation between roA/ro and E 
expected for the case of a single fixed distance (Eq. 6) and the broad distance distribution of a Gaussian chain [13], respectively, (e) 
Subpopulation-specific time-correlated single photon counting histograms for donor and acceptor photons from all bursts assigned to unfolded 
molecules, and subpopulation-specific donor intensity correlation function, in this case reporting on the nanosecond reconfiguration dynamics of 
the unfolded protein. 



fluorescence quantum yields, high photostability, and 
sufficient solubility in aqueous solutions. 

Confocal single-molecule spectroscopy 

Let us now assume that we have a suitably labelled mole- 
cule available, which carries at specific positions of our 
protein a donor and an acceptor fluorophore suitable for 
FRET (Figure 2b). In the simplest type of experiment, 
these molecules are diffusing freely in solution. In a con- 
focal instrument [59], we achieve single-molecule detec- 
tion in the following way (Figure 2a): a laser beam is 
focused into the sample solution to a diffraction-limited 



spot with a high numerical aperture objective. The small 
size of the resulting excitation volume (together with the 
confocal pinhole, see below) provides the requisite spatial 
selection, with an observation volume of ~lfl. Given the 
low concentration of fluorescently labelled molecules in 
solution (typically in the 10 to 100 pM range), the prob- 
ability of two molecules residing in the confocal volume 
at the same time becomes negligible. 

We use a laser wavelength for excitation that is reso- 
nant with the donor fluorophore of our FRET-labelled 
protein. If now a molecule happens to diffuse into the 
excitation volume, the donor dye will get electronically 
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excited and can relax back to the ground state either by 
emitting a photon itself or by transferring its energy to 
the second fluorophore attached to our protein, the 
acceptor dye (Figure 2b). Owing to the pronounced dis- 
tance dependence of coupling between donor and 
acceptor (Figure 1), the probability of energy transfer 
and thus the numbers of donor and acceptor photons 
emitted are highly sensitive to the interdye distance. In 
other words, we can obtain information about intramo- 
lecular distances by counting the numbers of photons 
emitted by donor and acceptor. 

Emitted photons are collected by the same objective 
used for focusing the laser (epifluorescence). A dichroic 
mirror that reflects the laser light but transmits the 
emitted fluorescence, in combination with suitably cho- 
sen filters, provides the necessary spectral separation 
between fluorescence emitted by our labelled molecule 
and Ught scattered by the solvent (Figure 2a). A confocal 
pinhole in the image plane of the objective serves as a 
spatial filter to remove out-of-focus light and thus com- 
pletes the spatial selection of the confocal detection 
scheme; except for a few background photons, we are 
now left only with fluorescence photons resulting from 
the FRET process. In the next steps, these photons are 
first sorted by their polarization using a polarizing beam 
splitter, and then by their wavelength, to distinguish 
donor and acceptor emission. The light is focused on 
highly sensitive detectors, typically avalanche photo- 
diodes (APDs), where the individual photons trigger 
electronic pulses that are recorded by suitable counting 
electronics, in modern instruments with a time resolu- 
tion limited by the jitter of the detectors, which is typi- 
cally in the 50 ps range. In such an instrument, the time 
stamps of all detected photons, together with the corre- 
sponding detection channels, are stored for subsequent 
processing. If pulsed laser excitation is used, the time of 
the exciting laser pulse is also recorded. 

Basics of data analysis 

From these records we thus have information on the 
colour, polarization, and arrival time of every individual 
photon, both on an absolute time scale and relative to 
the exciting laser pulse. The number of vertically and 
horizontally polarized photons can be used to calculate 
fluorescence polarization or anisotropy and thus reports 
on the rotational mobility of the fluorophores, a key 
aspect for quantitatively relating transfer efficiencies to 
distances [34]; the arrival time of a photon relative to 
the exciting laser pulse reports on the fluorescence life- 
time of the fluorophore, a quantity that is complemen- 
tary to ratiometric transfer efficiencies [60]; and the 
absolute arrival times can be employed to calculate cor- 
relation functions from picoseconds to seconds [61,62]. 
This type of data acquisition, which allows a wide range 



of derived quantities to be obtained from the data, is 
also termed multiparameter fluorescence detection [63]. 

A simple and common first step in analysing the data 
is to plot the number of donor and acceptor photons 
detected in a suitably chosen time bin, e.g. 1 ms. A typi- 
cal result is shown in Figure 2c, where bins with very 
low signal (corresponding to the background) are inter- 
rupted by bursts of photons originating from individual 
molecules diffusing through the confocal observation 
volume. The duration of these fluorescence bursts is 
determined by the time it takes a molecule to diffuse 
through the confocal volume, which in many typical 
applications for biomolecules is on the order of 1 ms; 
during each burst, typically up to several hundred 
photons can be detected. The bursts can be identified 
by some threshold criterion (e.g. that more than 100 
photons are detected in a millisecond bin, but more 
sophisticated algorithms are frequently employed [64]), 
and each of them can be analysed individually, e.g. in 
terms of the transfer efficiency of the respective mole- 
cule. The most widely used procedure is to calculate the 
transfer efficiencies ratiometrically [65] based on the 
number of detected photons from donor and acceptor, 
«D and «A. respectively: 



(«D and Ha need to be corrected for the differences in 
quantum yields of the dyes, the efficiencies of the detec- 
tion system in the corresponding wavelength ranges, 
and related effects [48,65]. Note that E defined in this 
way is a randomly distributed quantity, see below. For 
the true transfer efficiency, the average photon count 
rates from donor and acceptor have to be used instead 
of nA and no-) From a typical measurement (of minutes 
to hours, depending on the statistics required), thou- 
sands of such events are acquired and used to construct 
histograms (Figure 2d) that represent the distribution of 
transfer efficiencies present in the sample. 

Figure 2d illustrates such a histogram in terms of a 
simple example. A small FRET-labelled protein is investi- 
gated under conditions where both the folded and the 
unfolded state are populated at equilibrium (in this exam- 
ple at 1.5 M of the denaturant guanidinium chloride, 
GdmCl). In the folded structure, donor and acceptor are 
in close proximity, resulting in high transfer efficiency 
values; in the unfolded state, the average distance 
between the fluorophores is much greater, and the trans- 
fer efficiency is thus expected to be lower. Two corre- 
sponding peaks are observed in the transfer efficiency 
histogram. This aspect already illustrates one of the key 
strengths of single molecule spectroscopy: the two subpo- 
pulations present in the sample can be separated and 
thus investigated individually even if they coexist. The 
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corresponding ensemble experiment would only provide 
the average over the entire transfer efficiency distribution, 
and extracting information about the subpopulations 
would be much more difficult or even impossible. An 
additional peak is usually observed at a transfer efficiency 
of zero. This "donor-only" peak results from molecules 
that lack an active acceptor, either owing to imperfections 
of the labelling procedure or because the acceptor dye 
was photobleached in a previous passage through the 
confocal volume. This simple analysis provides a first 
impression of the heterogeneity of the sample and the 
number of subpopulations present. 

However, with the availability of the other parameters 
mentioned above, we can also proceed from one-dimen- 
sional to higher-dimensional data analysis. Figure 2d 
shows an example where the relative fluorescence life- 
time of the donor is plotted versus the transfer effi- 
ciency in a 2D histogram. The donor fluorescence 
lifetime provides an independent way of determining the 
transfer efficiency that is less susceptible to artefacts 
(caused e.g. by fluorescence quenching) than the ratio- 
metric measurement [63]. In the simplest case of a sin- 
gle fixed distance, the fluorescence lifetime of the donor 
in the presence (tda) and absence (td) of the acceptor 
can be related to the transfer efficiency by [34] 



(If distance distributions are present in the molecules, 
this simple relationship no longer holds, and the experi- 
mentally observed deviations can be used to infer infor- 
mation about the distance distribution and the 
underlying dynamics [63,66], see Figure 2d.) Similarly, 
plots involving fluorescence anisotropics can be used to 
assess the rotational mobility of the dyes and judge 
whether orientational averaging of the chromophores 
can be assumed, which greatly simplifies the FRET ana- 
lysis, or whether their relative orientational distribution 
has to be taken into account explicitly [34,67]. All of 
these parameters can be calculated from the signal of 
each individual molecule [64], be it in free diffusion 
experiments or in measurements on immobilized sam- 
ples. Note, however, that determining quantities such as 
the transfer efficiency or the fluorescence lifetime from 
~100 photons is intrinsically uncertain, as will be dis- 
cussed below. This uncertainty can be reduced by sub- 
population-specific analysis. If, e.g., all photons from 
bursts originating from unfolded molecules are com- 
bined, fluorescence intensity decays for the unfolded 
state can be extracted with much better signal-to-noise 
ratio (Figure 2e). Similarly, subpopulation-specific corre- 
lation functions can be extracted to investigate the 
dynamics of the molecules in a specific state or confor- 
mation (Figure 2e). 



Variations on the theme 

The procedures outlined so far represent a common 
approach in advanced single-molecule fluorescence 
spectroscopy, and suitable four-channel instruments are 
commercially available [68]. However, there are many 
variants of the type of instrumentation and analysis 
methods, which cannot be discussed here in detail. 
Instead I will mention some of them briefly and refer 
the reader to examples from the relevant literature. 

For basic applications, simpler instruments with only 
two detection channels and continuous-wave laser exci- 
tation are very commonly employed. Such instruments 
are sufficient to measure ratiometric transfer efficiencies, 
but they lack the possibility of extending the analysis to 
fluorescence lifetimes or anisotropics at the same time. 
Another possibility available with more than two detec- 
tions channels is to sort the photons by more than two 
wavelength bands [63]. A powerful application is the use 
of multicolour-FRET, where more than two fluoro- 
phores are used, which in principle allows three and 
more distances to be determined simultaneously 
[69-71]. A common technique is the use of multiple 
lasers for probing both donor and acceptor dye [72,73]. 
By rapidly alternating donor and acceptor excitation on 
the nanosecond to microsecond timescale, donor and 
acceptor dye can be probed independently, and mole- 
cules with low transfer efficiency can be distinguished 
from "donor-only" events. Additionally, the stoichiome- 
trics of donor to acceptor dyes present in the molecules 
can be quantified, which is often of interest for intermo- 
lecular interactions [56,57]. The types of lasers employed 
also vary. For some experiments, the use of pulsed lasers 
is advantageous because fluorescence lifetimes can be 
determined. For the measurement of sub-microsecond 
dynamics with correlation spectroscopy (see below), 
however, continuous wave lasers are employed. The 
choice of fluorophores will define the wavelengths 
required for excitation. 

A popular setup complementary to confocal detection 
is wide-field microscopy with two-dimensional detectors 
such as sensitive CCD cameras, often in combination 
with total internal reflection fluorescence (TIRF) [1,45]. 
Wide-field imaging has the big advantage that it allows 
the collection of data from many single molecules in par- 
allel. However, it requires their immobilization at the sur- 
face for excitation by the shallow evanescent field that is 
generated by the reflected laser beam. The time resolu- 
tion of wide-field methods is currently limited to the 
millisecond range by the frame-transfer rates of cameras 
with sufficiently high detection efficiency. This approach 
is thus most commonly applied to the investigation of 
processes on timescales of seconds to minutes. Note that 
confocal detection also allows the interrogation of 
surface-immobilized molecules, but the molecules have to 



Schu\er Journal of Nanobiotechnology 2Q^3, 11(Suppl 1):S2 
http://www.jnanobiotechnology.eom/content/1 1 /SI /S2 



Page 7 of 1 7 



be probed serially by sample or laser scanning, which 
often complicates experiments under non-equilibrium 
conditions. 

Finally, a wide variety of data analysis methods beyond 
the basic set of commonly used procedures are 
employed. These range from different strategies for 
burst identification (e.g. binning with fixed time win- 
dows or based on inter-photon times [60]), correction 
for background, direct excitation, and crosstalk between 
the detection channels [48,63], to a wide range of more 
sophisticated methods that take into account the rela- 
tionship between all recorded parameters [63] and the 
details of photon statistics [18,74-78]. 

Photon statistics and FRET efficiency distributions 

A detailed treatment of the role of photon statistics for 
the analysis of single-molecule data is beyond the scope 
of this basic tutorial, but a few aspects are essential to 
keep in mind. First, it is important to recognize that 
even for a molecule with a single fixed distance, the 
resulting FRET efficiency histogram is broadened. A 
fundamental source of broadening is shot noise, the var- 
iation in count rates about their mean values due to the 
discrete nature of the signal. In other words, we observe 
a statistical distribution of FRET efficiencies since only 
small numbers of photons are detected from an indivi- 
dual molecule. The variance of the corresponding trans- 
fer efficiency distribution due to shot noise can be 
calculated as [79] 

(7/„ = (£2) - {E)^ = {£)(!- {£)) (1/N) <{£)(!- {£))/N,-, (7) 

where (£) is the true underlying mean transfer effi- 
ciency, (l/N) is the average of the inverse numbers of 
photons per burst, and is the minimum burst size, i. 
e. the threshold used for data analysis. In other words, 
the observation of a broadened transfer efficiency histo- 
gram does not necessarily imply the existence of a 
broad underlying distance distribution. Similar consid- 
erations lead to broad distributions of fluorescence life- 
times or anisotropies from observations involving a 
small number of photons. 

Since the contribution of shot noise to the measured 
transfer efficiency distributions is well understood, rigor- 
ous methods are available to fit the measured transfer 
efficiency histograms directly assuming an underlying, 
shot noise-free, transfer efficiency distribution [74,79-82]. 
In the case of a single fixed distance (or dynamics much 
faster than the interphoton time, see below), this resulting 
distribution should be close to a delta function. In prac- 
tice, histograms broader than expected from shot noise 
alone are commonly observed. The origin of this excess 
width is often difficult to establish unequivocally [17,83], 
but there are factors besides conformational heterogeneity 
that can contribute, such as variations in fluorescence 



quantum efficiencies, labelling permutations, or an imper- 
fect alignment of the confocal volumes for donor and 
acceptor channels. Consequently, without a suitable refer- 
ence, it is often difficult to interpret a width in excess of 
shot noise in terms of slow conformational dynamics. 
Multiparameter fluorescence detection is very helpful for 
identifying such complications and taking them into 
account for the analysis [63]. 

The most interesting cause of broadening of the trans- 
fer efficiency distribution is of course the presence of a 
distance distribution in the sample. In some cases, this 
heterogeneity is quite obvious, as in the case of folded 
and unfolded molecules shown in Figure 2d. However, 
take for example the subpopulation of unfolded mole- 
cules itself: clearly, a broad distribution of intramolecu- 
lar distances will be present in the unfolded state, and 
yet, the peak corresponding to unfolded molecules is 
not much broader than that of the folded state. As in 
many other spectroscopic techniques, a crucial aspect 
that needs to be taken into account are the timescales 
involved in the measurement and the dynamics of the 
molecular system. 

Distance distributions and relevant timescales 

The relative magnitudes of the timescales of at least four 
different processes have an influence on the position 
and the width of the FRET efficiency histogram: (a) the 
rotational correlation time of the chromophores, (b) the 
fluorescence lifetime of the donor, (c) the intramolecular 
dynamics of the molecule probed by the fluorophores, 
and (d) the observation timescale. 

The rotational correlation time of the chromophores 
influences the value of the orientation factor k (Eq. 3): 
if dye reorientation is sufficiently fast for the relative 
orientation of the donor and acceptor dipoles to average 
out while the donor is in the excited state, can be 
assumed to equal 2/3. If, in the other extreme, the 
donor fluorescence decay is much faster than dye reor- 
ientation, a static distribution of relative dye orientations 
can be assumed. Intermediate cases are difficult to treat 
analytically [34], and simulations become the method of 
choice. K = 2/3 is often a good approximation because 
the rotational correlation times of typical dyes are in the 
range of a few hundred picoseconds (if there is no per- 
sistent interaction with the protein), while their fluores- 
cence lifetimes are in the nanosecond range (although 
they may be reduced significantly by the transfer pro- 
cess), but this aspect must be verified for every molecu- 
lar system under investigation. 

Most importantly, the timescale of interdye distance 
dynamics relative to the observation time (more accu- 
rately, the inter-photon times [84]) will affect the width of 
the measured transfer efficiency distributions. As shown 
by Gopich and Szabo [80,84], the observation time must 
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be approximately an order of magnitude smaller than the 
relaxation time of the donor-acceptor distance to obtain 
physically meaningful distance distributions or corre- 
sponding potentials of mean force from transfer efficiency 
histograms. Otherwise, only the mean value of the transfer 
efficiency of the respective subpopulation can be used to 
extract information about the distance distribution, and an 
independent model for the shape of the distance distribu- 
tion is needed. In practice, this means that distance distri- 
butions can be determined from FRET efficiency 
histograms from free diffusion experiments if the underly- 
ing dynamics are on a timescale greater than about 1 ms, 
assuming photon count rates of -10 s' typically achieved 
during fluorescence bursts [84]. A noticeable influence of 
dynamics on the width of the transfer efficiency histo- 
grams, however, is already expected for fluctuations in the 
10 to 100 i^s timescale [17,84]. As a result, reaction 
dynamics can be obtained from the shape of transfer effi- 
ciency histograms [19,85]. Figure 3 illustrates how changes 
in the interconversion kinetics between two populations 
can influence the shape of FRET efficiency histograms and 
how their analysis can be used to determine the underly- 
ing exchange kinetics [20,76]. 

Given a distance distribution, P{r), three physically 
plausible limits for the possible averaging regimes and 
the resulting mean transfer efficiencies, (£), are [86,87]: 

1. If the rotational correlation time, !„ of the chromo- 
phores is small relative to the fluorescence lifetime, r^, 
of the donor (i.e. k = 2/3), and the dynamics of the 
polypeptide chain (with relaxation time Zr) are slow rela- 
tive to Td, 



J a 



£(r)P(r)dr with E(r) = (l + (r/Ro)'^) \ 



(8) 



where P{r) is the normalized inter-dye distance distri- 
bution; a is the distance of closest approach of the dyes; 
and 4 is the contour length of the peptide. This scenario 
typically applies to cases such as unfolded or intrinsi- 
cally disordered proteins [12,17,88]. 

2. If « and « r^. 



f;{Rolrfp{r)dr 
l+f;{Rolrfp{r)dr 



(9) 



This dynamic averaging regime often applies to the 
local motion of the dyes on their linkers or in short 
peptides [87]. 

3. If » rjj and » r^. 

This limit would correspond to a situation where both 
the relative orientations of the fluorophores and the dis- 
tance distribution are essentially static on the observation 



timescale. In such a scenario, extreme broadening of the 
transfer efficiency distributions can result, with contribu- 
tions from both orientational and distance effects on the 
FRET efficiency [34,67]. The theoretical isotropic prob- 
ability density of n for the case in which all orientations 
of the donor and acceptor transition dipoles are equally 
probable is [34,89] 



= In 



-. ln(2 + V3) 



2 + V3 



^ + V<c^ - 1 



0 < /f^ < 1 



1 < /f^ < 4 



(11) 



Another interesting consequence of the limiting cases 
above can be illustrated with the example of an 
unfolded protein (Figure 2). As will be shown in detail 
below, the inter-dye distance dynamics in this case are 
much faster (~100 ns) than the millisecond observation 
time and the average interphoton time relevant for the 
transfer efficiency histogram. As a result, the width of 
the histogram does not report on the distance distribu- 
tion, but only on its mean value - the information on 
the shape of P{r) is lost. The fluorescence lifetime (~1 
ns), however, is much shorter than the timescale of 
chain dynamics. In other words, the chain is essentially 
static during an excitation/ emission cycle of the fluoro- 
phores, and the distribution of transfer rates resulting 
from the broad inter-dye distance distribution will give 
rise to highly non-exponential fluorescence decays, 
which contain information on the shape of P{r) [37]. 
From single-molecule FRET measurements, these decays 
can be determined for individual subpopulations to 
remove contributions from other species (Figure 2d, e) 
and can be analysed in detail in terms of distance distri- 
butions [88,90]. As a result, the presence or absence of 
a distance distribution will also affect the position and 
shape of the subpopulation peaks in lifetime vs. transfer 
efficiency 2D-histograms (Figure 2d) [13,66,78]. 

These examples illustrate how the timescales of the 
FRET process and the molecular dynamics influence the 
transfer efficiencies and their distributions observed in 
single-molecule experiments. For a quantitative analysis 
of single-molecule FRET experiments, it is thus essential 
to know the relevant timescales involved. How fluores- 
cence lifetimes and anisotropies can be obtained from 
the measurements was already discussed, and these are 
essential for quantifying Td and and thus for assigning 
the experimental situation in question to the limiting 
cases given above. Another very generally applicable 
method for quantifying the dynamics of the system is 
correlation spectroscopy. 

Correlation analysis 

In ensemble experiments, the most common approach 
for investigating the relaxation kinetics of a reaction is 
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Figure 3 Effects of conformational dynamics on transfer efficiency histograms In single-molecule FRET experiments on freely diffusing 
molecules. The histograms (blue bars) were calculated assuming that the molecules in the sample undergo transitions between two 
conformational states with low (fi = 0.4) and high {E2 = 0.8) transfer efficiency. The histograms are shown for different values of the transition 
rate, given in the upper left corner of the histogram. In the first column, the equilibrium populations of two the states are equal; in the second 
column, the populations differ by a factor of 2. For low values of k, two well-separated peaks are observed, corresponding to the two 
subpopulations. With increasing relaxation rate, it becomes more likely that a conformational transition occurs during a transit of the molecule 
through the confocal volume (and thus a fluorescence burst), resulting in an increasing number of events with apparent transfer efficiencies in 
the range between the true transfer efficiencies f, and E2. At high relaxation rates, transitions occur so frequently during a fluorescence burst 
that the two subpopulations become indistinguishable; only one peak remains, whose relative position between fi and £2 is determined by the 
equilibrium constant. The histograms are compared to the Gaussian approximation [20] (red lines), which can also be used to determine rate 
constants from measured histograms[19]. Figure provided by Irina Gopich [79]. 



to perturb the entire sample, e.g. by a rapid change in 
denaturant concentration in a stopped-flow instrument 
or by a laser-induced temperature jump, and then to 
observe the system return to equilibrium under the 
new set of conditions. The resulting kinetics can 
be analysed in terms of kinetic models to identify the 
underlying molecular mechanisms. According to the 



fluctuation-dissipation theorem [91], the rate of relaxation 
of a system to equilibrium after a small macroscopic per- 
turbation and the time correlation of spontaneous fluctua- 
tions of the undisturbed system at equilibrium are 
described by the same rate coefficients. Single-molecule 
spectroscopy allows us to detect such spontaneous fluctua- 
tions at equilibrium, and correlation analysis is a versatile 
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Figure 4 Quantifying translational diffusion with FCS. An FCS curve of fluorescently labelled ribosomes freely diffusing in solution (blue) is 
shown with a fit using Eq. 1 3 (omitting the exponential term; data by J. Clark & B. Schuler, unpublished). From the amplitude of the correlation 
function, the average number of fluorescent particles in the confocal volume, (N), can be determined. With knowledge of the size of the 
confocal volume, M, based on its half-width, m^y, and height, cu^, the particle concentration can be calculated. From the diffusion time, tq, the 
translational diffusion coefficient, D, of the particles is obtained, which can be related to the Stokes radius, r^, via the Stokes-Einstein relation 
(where kg is Boltzmann's constant, J is temperature, and r\ is the solution viscosity). The inset shows a structural representation of the ribosome 
(fj = 12 nm). 



approach for quantifying the dynamics of chemical reac- 
tions or conformational changes in the absence of 
perturbations. 

For instance, the number of molecules present in a 
confocal volume will fluctuate since molecules continu- 
ously enter and leave the observation region by diffu- 
sion. Similarly, if we consider the example of a folding 
protein, each molecule will stochastically jump between 
folded and unfolded states. Both processes will lead to 
fluctuations in the fluorescence count rates. Powerful 
tools for analysing such fluctuations are correlation 
functions [92]. The autocorrelation function of the 
property A, for instance, is defined as 

1 ^ 

(A (t) A (t -E t)> = lim - / A (t) A (t -E r) dt. (12) 

r^>oo T Q 

(Strictly speaking, this definition is only true for an 
ergodic system, where the averaging is independent of 
the starting time t, and in any experimental measure- 
ment the averaging is of course done over finite time.) 



Crosscorrelation functions between different proper- 
ties or signals, e.g. fluorescence emission from a donor 
and an acceptor chromophores undergoing FRET, can 
be defined analogously. The autocorrelation function 
of a non-conserved, non-periodic property decays 
from its initial value {A ) to the final value (A) with a 
time constant characteristic of the fluctuation of A, 
where A(t) and A{t+T) are expected to become uncor- 
related at long times. In many cases, the autocorrela- 
tion function decays like a single exponential with a 
characteristic relaxation time or correlation time of 
the property, but often it takes a more complicated 
functional form. 

Fluorescence correlation spectroscopy (FCS) 

A common example for a fluorescence correlation 
experiment [93] is the measurement of translational dif- 
fusion, where fluorescently labelled molecules diffuse 
through a confocal volume approximated by a three- 
dimensional Gaussian shape. The resulting intensity 
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Figure 5 Correlation spectroscopy can be used to probe a wide range of timescales and processes. Illustration of processes that can 
contribute to the fluorescence correlation functions of freely diffusing molecules, with their characteristic timescales and corresponding 
amplitudes. Illustrated are: photon antibunching [1 12] in the range of the fluorescence lifetimes of the fluorophores, rotational diffusion with a 
correlation time r,of, and interdye distance dynamics due to FRET with a reconfiguration time r, (note that r,o, and i, can be in a similar range, 
but can be distinguished by the donor-acceptor crosscorrelation (Figure 6), which shows anticorrelated behaviour in the case of distance 
dynamics); triplet state blinking on a timescale zf, and translational diffusion (Figure 4) with a diffusion time z^iff. Timescales much greater than 
ZdiffSte not accessible with freely diffusing molecules, but can be extended by taking advantage of recurrence effects [18] (i.e. molecules 
returning to the confocal volume) or with experiments on immobilized molecules (Figure 7). 



autocorrelation function normalised by the mean inten- 
sity squared is [27] 

G(z) = l + — + '(i + —LJ] '"(l+Ke-'M, (13) 

(N) V V "''W/ ^ ^ ^ 

where (N) is the average number of molecules in the 
observed volume; r^,y is the characteristic time it takes a 
molecule to diffuse through the observation volume; oj is 
the aspect ratio of the volume; and K is the equilibrium 
constant of a reaction with a relaxation time resulting 
in fluctuations of the emission intensity. In this case, 
there are thus two mechanisms contributing to the 
observed intensity fluctuations: diffusion of molecules in 
and out of the confocal volume, and fluctuations in the 



fluorescence rates caused by the reaction, which could, 
for example, be a protein folding reaction. (From Eq. 13, 
it is obvious that the observation of reaction dynamics is 
limited to timescales not much greater than the diffusion 
timescale, because the diffusive terms will decay to zero 
for T » Tjiff.) Fits of corresponding experiments with Eq. 
13 can thus be used to determine several parameters: 
translational diffusion coefficients from Tdyf and the radius 
of the confocal volume (Figure 4); the concentration of 
fluorescent molecules from (N) and the size of confocal 
volume (Figure 4); and the equilibrium constant and 
relaxation dynamics of the reaction. Note that for FCS 
experiments, the sample concentration is typically chosen 
in the nanomolar range to optimize the signal-to-noise 
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Figure 6 Nanosecond correlation spectroscopy used for the 
determination of unfolded state dynamics on the sub- 
microsecond timescale. (top) Cartoon of a FRET-labelled unfolded 
protein undergoing diffusive chain dynamics, resulting in 
fluctuations of the inter-dye distance, r(fj [12]. (bottom) Donor- 
donor (DD), acceptor-acceptor (AA), and donor-acceptor (DA) 
intensity correlation functions for an unfolded protein are shown 
(CspFm in 4 M GdmCI [96]) in the nanosecond range, where the 
decay at about 50 ns reports on the reconfiguration time of the 
polypeptide chain. Note that the donor-acceptor crosscorrelation 
shows anticorrelated behaviour, a signature of distance dynamics. 
Antibunching [1 12] occurs on the timescale of a few nanoseconds. 
Fits are shown as solid lines [95]. 



ratio [27]. In this concentration range, a separation of 
fluorescence bursts from background signal is no longer 
possible with usual confocal instruments; however, the 
same analysis can of course be applied to picomolar solu- 
tions (Figure 2c, e) and thus be combined with the 
single-molecule methods described above. 

More generally speaking, any process that leads to fluc- 
tuations in fluorescence count rates on an accessible time- 
scale will contribute to the shape of the correlation 



function (Figure 5). Besides translational diffusion and 
reaction dynamics, such processes can be of photophysical 
origin (e.g. triplet state blinking in the microsecond range 
or photon antibunching [94] resulting from the property 
of a single quantum system that it cannot emit two 
photons at the same time, and which thus occurs in the 
range of the fluorescence lifetime); they can result from 
rotational diffusion (typically on the 1 to 100 ns timescale); 
or from molecular interactions between different fluores- 
cently labelled species, to name a few examples. 

Of particular interest for the investigation of biomole- 
cules are fluorescence fluctuations caused by conforma- 
tional dynamics. An example that illustrates the 
possibility to extract very rapid dynamics from correla- 
tion spectroscopy is given in Figure 6. Consider an 
unfolded polypeptide chain labelled with a FRET pair. 
In solution, the chain will show rapid diffusive intramo- 
lecular dynamics, and as a result, the distance between 
donor and acceptor will fluctuate. Owing to the FRET 
coupling between them, donor and acceptor emission 
rates will fluctuate on the same timescale. This timescale 
characterises the intramolecular dynamics of the chain 
and closely corresponds to its reconfiguration time [95]. 
By analysing the donor and acceptor intensity correla- 
tion functions, the reconfiguration time can be quanti- 
fied; for unfolded and intrinsically disordered proteins it 
turns out to be in the range of -100 ns [12,13,96-98], 
close to the reconfiguration times expected from simple 
polymer dynamics [99], but with clear indications for 
the presence of "internal friction", i.e. nonspecific inter- 
actions within the chain that slow down the dynamics 
and depend on the compactness of the polypeptide 
chain [13,97]. 

Correlation analysis of fluorescent molecules freely 
diffusing in solution, as in typical FCS experiments, can 
provide information on dynamics from nanoseconds to 
milliseconds. However, correlation analysis can equally 
be applied to the signal recorded from single immobi- 
lised molecules to extend the range of accessible times. 
For immobilized molecules, the observation time is lim- 
ited only by photobleaching, and with suitable photo- 
protective solution additives [100], observation times of 
minutes can be achieved [1]. The bleaching time will 
critically depend on the laser power and thus the excita- 
tion rates employed. Every experiment will thus have to 
be optimized with respect to the compromise between 
time resolution - determined by the number of photons 
detected per time - and the total observation time 
required to probe the dynamic processes of interest. 

Kinetics and thermodynamics from single-molecule 
fluorescence trajectories 

In many cases, dynamics can be determined from single- 
molecule experiments without having to resort to 
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Figure 7 Kinetics from single-molecule fluorescence trajectories and single-photon time series (a) A representation of a ribosome 
structure witii different structural features and labelling sites is shown on the left. On the right, a single-molecule fluorescence trajectory (green, 
donor; red, acceptor) and resulting FRET trajectory (blue) obtained using a wide-field TIRF imaging system at 40 ms time resolution under 
equilibrium conditions from a surface-immobilized ribosome bearing site-specifically labelled A-site and P-site tRNA molecules [1 13]. The 
dynamic FRET data, reporting directly on thermally accessible, nanometre-scale changes in tRNA position within the ribosome, persist until donor 
and/or acceptor fluorophore photobleach. Figure from ref [114]. (b) On the left, a photon time series (green, donor; red, acceptor) from a 
confocal measurement using single-photon counting is shown. A state trajectory obtained from the Viterbi algorithm is shown as a blue line, 
with segments corresponding to folded and unfolded protein, and transition points [19]. On the right, a cartoon of the structure of the small 
protein a-jD immobilized on a surface is shown, whose folding was investigated here. Figure adapted from ref [19]. 



correlation analysis. The most prominent examples 
involve dynamics on the timescale of seconds that are 
visible directly in fluorescence recordings of immobi- 
lized molecules, e.g. as jumps between two or more 
levels of transfer efficiencies, corresponding to different 
conformations of a molecule [1] (Figure 7a). With data 
of sufficient quality, the kinetics of the system can be 
analysed directly from the distributions of dwell times 
in the individual states, much alike the methods estab- 
lished in the field of single-channel recording [101]. In 
the simplest case of a two-state reaction, A <-> B, the 
rate coefficients of interconversion can be obtained 
from the inverse values of the average dwell times in 
states A and B, respectively, and the equilibrium con- 
stant is given by the ratio of the average dell times 
[101]. In cases where the transitions are difficult to 
identify, hidden Markov models [25,102,103] or maxi- 
mum likelihood methods [22,75,103] can be essential 
for a quantitative analysis in terms of kinetic models 
(Figure 7b). 

Nonequlllbrium dynamics of single molecules 

Even though kinetic information can often be obtained 
from equilibrium single-molecule experiments, in many 
cases it is still essential to probe nonequilibrium 



dynamics, especially if the reaction of interest is essen- 
tially irreversible during the observation time accessible 
at equilibrium. A method that lends itself very well to 
the combination with single-molecule detection optics 
is microfluidic mixing, and several different implemen- 
tations have been reported [104-109]. The basic idea of 
these devices is to mix solutions in stationary laminar 
flow by reducing the dimensions of the channels in 
order that the components of the solutions that are 
combined exchange very quickly, solely by diffusion 
[110]. After mixing, the confocal observation volume is 
placed at different points in the observation channel, 
corresponding to different times after the start of the 
reaction (Figure 8), with dead times in the millisecond 
range [109] and even below [107]. Microfluidic mixing 
can be used, e.g., to rapidly change solution conditions 
or to investigate the kinetics of protein-protein 
interactions [104-107,111] and has thus become a 
valuable extension of the growing single-molecule 
toolbox. 

Conclusions 

I hope that this brief introductory tutorial will stimulate the 
reader's appetite for single-molecule spectroscopy and will 
facilitate the exploration of the more specialized literature. 
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Figure 8 Microfluidics for probing nonequilibrium dynamics with single-molecule spectroscopy The millisecond unfolding kinetics of a 
small protein (BdpA) upon mixing with denaturant are shown as an example for the application of microfluidics [1 15]. Folded BdpA in the 
centre inlet was mixed with 3.3 M of the denaturant GdmCI from the side inlet channels to trigger unfolding (see inset for an electron 
micrograph of the microfluidic structure in the mixing region of the device), (a) The fraction of folded population (blue data points) was 
determined from fits to the corresponding transfer efficiency histograms (solid lines in b-d) and plotted as a function of time after mixing. The 
data were fit with a single exponential decay (dashed red line), (b-d) Representative transfer efficiency histograms measured at the positions 
indicated in (a). Figure adapted from ref [115]. 



The rapidly growing number of applications of single- 
molecule spectroscopy to biomolecular systems now covers 
an impressive range of advanced techniques and topics, 
many of which could not even be mentioned here. Further- 
more, even though single-molecule spectroscopy has 
reached a certain level of maturity, new techniques and 
analysis methods are still emerging on a regular basis, and 
the scope of single-molecule methods thus keeps 
expanding. 
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FRET: Forster resonance energy transfer; FCS: Fluorescence correlation 
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